Model Selection

Native Multimodal Pretraining

# Native Multimodal Pretraining

Internvl3 38B Instruct GGUF

InternVL3-38B-Instruct is an advanced Multimodal Large Language Model (MLLM) that demonstrates exceptional overall performance, with strong multimodal perception and reasoning capabilities.

Internvl3 8B GGUF

InternVL3 is an advanced multimodal large language model series, demonstrating exceptional overall performance with robust multimodal perception and reasoning capabilities.

Internvl3 9B AWQ

InternVL3-9B is a multimodal large language model from the InternVL3 series, featuring exceptional multimodal perception and reasoning capabilities. It supports various application scenarios such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.

Transformers Other

Internvl3 8B AWQ

InternVL3-8B is an advanced multimodal large language model developed by OpenGVLab, featuring powerful multimodal perception and reasoning capabilities, supporting tool calling, GUI agents, industrial image analysis, 3D visual perception, and other emerging fields.

Transformers Other

Internvl3 2B AWQ

InternVL3-2B is an advanced Multimodal Large Language Model (MLLM) developed by OpenGVLab, featuring exceptional multimodal perception and reasoning capabilities, supporting tool usage, GUI agents, industrial image analysis, 3D visual perception, and more.

Transformers Other

Internvl3 1B AWQ

InternVL3-1B is a multimodal large language model in the InternVL3 series, featuring exceptional multimodal perception and reasoning capabilities.

Transformers Other

Internvl3 2B Pretrained

InternVL3-2B is an advanced multimodal large language model developed by OpenGVLab, featuring robust visual-language understanding and reasoning capabilities, supporting various multimodal tasks.

Transformers Other

Internvl3 9B Instruct

InternVL3-9B-Instruct is the supervised fine-tuned version of the InternVL3 series, featuring powerful multimodal perception and reasoning capabilities, supporting various modalities such as images, text, and videos.

Transformers Other

Internvl3 8B Instruct

InternVL3-8B-Instruct is an advanced Multimodal Large Language Model (MLLM) that demonstrates exceptional multimodal perception and reasoning capabilities, supporting various functionalities such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.

Transformers Other

Internvl3 2B Instruct

InternVL3-2B-Instruct is a supervised fine-tuned version based on InternVL3-2B, undergoing native multimodal pretraining and SFT processing, equipped with powerful multimodal perception and reasoning capabilities.

Transformers Other

Internvl3 1B Instruct

InternVL3-1B-Instruct is the supervised fine-tuned version of the InternVL3 series, based on native multimodal pretraining, with exceptional multimodal perception and reasoning capabilities.

Transformers Other

Internvl3 78B Instruct

InternVL3-78B-Instruct is an advanced multimodal large language model developed by OpenGVLab, demonstrating exceptional multimodal perception and reasoning capabilities, supporting various tasks such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.

Transformers Other

InternVL3-1B is a 1B-parameter multimodal large language model in the InternVL3 series, integrating the InternViT visual encoder and Qwen2.5 language model, with exceptional multimodal perception and reasoning capabilities.

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase